JCC-H: Adding Join Crossing Correlations with Skew to TPC-H
نویسندگان
چکیده
We introduce JCC-H, a drop-in replacement for the data and query generator of TPC-H, that introduces Join-Crossing-Correlations (JCC) and skew into its dataset and query workload. These correlations are carefully designed such that the filter predicates on table columns in the existing TPC-H queries now suddenly can have effects on the value, frequencyand join-fan-out-distributions, experienced by operators in the query plan. The query generator of JCC-H is able to generate parameter bindings for the 22 query templates in two different equivalence classes: query templates that receive “normal” parameters do not experience skew and behave very similar to default TPC-H queries. Query templates expanded with the “skewed” parameters, though, experience strong join-crossing-correlations and skew in filter, aggregation and join operations. In this paper we discuss the goals of JCC-H, its detailed design, as well as show initial experiments on both a single-server and MPP database system, that confirm that our design goals were largely met. In all, JCC-H provides a convenient way for any system that is already testing with TPC-H to examine how the system can handle skew and correlations, so we hope the community can use it to make progress on issues like skew mitigation and detection and exploitation of join-crossing-correlations in query optimizers and data storage.
منابع مشابه
On Powers of Some Graph Operations
Let $G*H$ be the product $*$ of $G$ and $H$. In this paper we determine the rth power of the graph $G*H$ in terms of $G^r, H^r$ and $G^r*H^r$, when $*$ is the join, Cartesian, symmetric difference, disjunctive, composition, skew and corona product. Then we solve the equation $(G*H)^r=G^r*H^r$. We also compute the Wiener index and Wiener polarity index of the skew product.
متن کاملTPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark
The TPC-D benchmark was developed almost 20 years ago, and even though its current existence as TPC-H could be considered superseded by TPC-DS, one can still learn from it. We focus on the technical level, summarizing the challenges posed by the TPC-H workload as we now understand them, which we call “choke points”. We identify 28 different such choke points, grouped into six categories: Aggreg...
متن کاملComputer generation of Hadamard matrices
We develop a computer code that uses elegant bit-manipulation techniques for matrix multiplication and thus facilitates exhaustive generation of Skew-Hadamard matrices. Hadamard matrices are useful in spectroscopic applications (Hadamard transform spectroscopy) and in balanced chemical designs. Application of our code yields several Skew-Hadamard matrices up to order 100 X 100, although the com...
متن کاملResiduated Skew Lattice with a Operation
In this paper, we define hedge operation on a residuated skew lattice and investigate some its properties. We get relationships between some special sets as dense, nilpotent, idempotent, regular elements sets and their hedges. By examples, we show that hedge of a dense element is not a dense and hedge of a regular element is not a regular. Also hedge of a nilpotent element is a nilpotent and h...
متن کاملModes, Modals, and Barycentric Algebras: a Brief Survey and an Additivity Theorem
Modes are idempotent and entropic algebras. Modals are both join semilattices and modes, where the mode structure distributes over the join. Barycentric algebras are equipped with binary operations from the open unit interval, satisfying idempotence, skew-commutativity, and skew-associativity. The article aims to give a brief survey of these structures and some of their applications. Special at...
متن کامل